Likelihood decision boundary estimation between HMM pairs in speech recognition

نویسندگان

  • Levent M. Arslan
  • John H. L. Hansen
چکیده

In maximum likelihood estimation of hidden Markov models for speech recognition, the criterion is to maximize the total probability across the training data for a particular speech unit, such as a word, monophone, diphone, or triphone. Since each unit model is trained separately, such a strategy can often lead to biases among decision boundaries of the generated model set. In this paper, we propose a new technique to minimize the total number of misclassiications in the training data set by adjusting the decision boundaries between hidden Markov model pairs. The proposed algorithm is shown to reduce the error rate in a number of speech recognition tasks such as accent detection, language identiication, and confusable word pair discrimination. The technique is also attractive because it is simple to implement and the improvement in performance is achieved without any added complexity in the decoding phase.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature dimension reduction using reduced-rank maximum likelihood estimation for hidden Markov models

This paper presents a new method of feature dimension reduction in hidden Markov modeling (HMM) for speech recognition. The key idea is to apply reduced rank maximum likelihood estimation in the M-step of the usual Baum-Welch algorithm for estimating HMM parameters such that the estimates of the Gaussian distribution parameters are restricted in a sub-space of reduced dimensionality. There are ...

متن کامل

Dictionary-based discriminative HMM parameter estimation for continuous speech recognition systems

The estimation of the HMM parameters has always been a major issue in the design of speech recognition systems. Discriminative objectives like Maximum Mutual Information (MMI) or Minimum Classi cation Error (MCE) have proved to be superior over the common Maximum Likelihood Estimation (MLE) in cases where a robust estimation of the probabilistic density functions (pdfs) is not possible. The det...

متن کامل

An improved training algorithm in HMM-based speech recognition

In HMM-based speech recognition, estimation of parameters of HMMs is viewed as counterpart of training or learning in traditional sequential pattern recognition since speech signal can be represented by a sequence of ndimension vectors after features are extracted from the speech signal. However, due to variation of duration of the phone with speakers and context and its randomness, speech samp...

متن کامل

Predictive hidden Markov model selection for decision tree state tying

This paper presents a novel predictive information criterion (PIC) for hidden Markov model (HMM) selection. The PIC criterion is exploited to select the best HMMs, which provide the largest prediction information for generalization of future data. When the randomness of HMM parameters is expressed by a product of conjugate prior densities, the prediction information is derived without integral ...

متن کامل

Maximum likelihood noise HMMm estimation in model-based robust speech recognition

This paper presents a generalization of Rose's Integrated Parametric Model to the gaussian mixture hidden Markov model (HMM), formulation. Observations from clean speech HMM and noise HMM models are combined in the log spectra domain, through a corruption function, to generate noisy speech observations. In order to recognize noisy speech with the proposed model, when only the clean speech HMM a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Trans. Speech and Audio Processing

دوره 6  شماره 

صفحات  -

تاریخ انتشار 1998